Generating emotional speech with a concatenative synthesizer
نویسندگان
چکیده
We describe the attempt to synthesize emotional speech with a concatenative speech synthesizer using a parameter space covering not only f0, duration and amplitude, but also voice quality parameters, spectral energy distribution, harmonics-to-noise ratio, and articulatory precision. The application of these extended parameter set offers the possibility to combine the high segmental quality of concatenative synthesis with a wider range of control settings needed for the synthesis of natural affected speech.
منابع مشابه
A concatenative Mandarin TTS system without prosody model and prosody modification
This paper proposes a two-step solution for generating natural prosody in TTS, in which no prosody prediction and modification are needed. A large phonetically and prosodically enriched speech corpus has been collected as the unit pool for the synthesizer. A multi-tier non-uniform unit selection scheme is developed to pick up the most suitable segments for concatenation from the unit pool. Fina...
متن کاملExpressive speech synthesis using a concatenative synthesizer
1 This paper describes an experiment in synthesizing four emotional states anger, happiness, sadness and neutral – using a concatenative speech synthesizer. To achieve this, five emotionally (i.e., semantically) unbiased target sentences were prepared. Then, separate speech inventories, comprising the target diphones for each of the above emotions, were recorded. Using the 16 different combinat...
متن کاملExploiting improved parameter smoothing within a hybrid concatenative/LPC speech synthesizer
We depict the interpolation strategies for the concatenation of inventory demisyllables in our hybrid concatenative/LPC speech synthesizer. Inventory elements for vowels and nasals are cut in the steady state of the phoneme. Concatenating elements in the synthesis stage requires smoothing of spectral content and energy to avoid annoying discontinuities in these parameters, which is of vital imp...
متن کاملA Flexible, Scalable Finite-state Transducer Architecture for Corpus-based Concatenative Speech Synthesis1
In this paper we describe our work involving the conversion of our phonologically-based synthesizer into a finite-state transducer (FST) representation which can be used for real-time natural-sounding synthesis. We have designed a transducer structure to efficiently perform the common task of unit selection in concatenative speech synthesis. By encapsulating domainindependent concatenative synt...
متن کاملVerification of Acoustical Correlates of Emotional Speech using Formant-Synthesis
This paper explores the perceptual relevance of acoustical correlates of emotional speech by means of speech synthesis. Besides, the research aims at the development of »emotionrules« which enable an optimized speech synthesis system to generate emotional speech. Two investigations using this synthesizer are described: 1) the systematic variation of selected acoustical features to gain a prelim...
متن کامل